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Expanded Classification of Hepatitis C Virus Into 7 
Genotypes and 67 Subtypes: Updated Criteria and 
Genotype Assignment Web Resource 

Donald B. Smith, 1 Jens Bukh, 2 Carla Kuiken, 3 A. Scott Muerhoff, 4 Charles M. Rice, 5 
Jack T. Stapleton, 6 and Peter Simmonds 1 

The 2005 consensus proposal for the classification of hepatitis C virus (HCV) presented 
an agreed and uniform nomenclature for HCV variants and the criteria for their assign- 
ment into genotypes and subtypes. Since its publication, the available dataset of HCV 
sequences has vasdy expanded through advancement in nucleotide sequencing technolo- 
gies and an increasing focus on the role of HCV genetic variation in disease and treatment 
outcomes. The current study represents a major update to the previous consensus HCV 
classification, incorporating additional sequence information derived from over 1,300 
(near-) complete genome sequences of HCV available on public databases in May 2013. 
Analysis resolved several nomenclature conflicts between genotype designations and using 
consensus criteria created a classification of HCV into seven confirmed genotypes and 67 
subtypes. There are 21 additional complete coding region sequences of unassigned sub- 
type. The study additionally describes the development of a Web resource hosted by the 
International Committee for Taxonomy of Viruses (ICTV) that maintains and 
regularly updates tables of reference isolates, accession numbers, and annotated align- 
ments (http://talk.ictvonline.org/links/hcv/hcv-classification.htm). The Flaviviridae Study 
Group urges those who need to check or propose new genotypes or subtypes of HCV to 
contact the Study Group in advance of publication to avoid nomenclature conflicts 
appearing in the literature. While the criteria for assigning genotypes and subtypes remain 
unchanged from previous consensus proposals, changes are proposed in the assignment of 
provisional subtypes, subtype numbering beyond "w," and the nomenclature of intergeno- 
typic recombinant. Conclusion: This study represents an important reference point for the 
consensus classification of HCV variants that will be of value to researchers working in 
clinical and basic science fields. (Hepatology 2014;59:318-327) 



Soon after the publication of the first nearly com- 
plete genome sequence of hepatitis C virus 
(HCV) in 1 989, 1 it became apparent that isolates 
from different individuals or countries showed substan- 
tial genetic diversity. After much research and survey- 
ing by groups worldwide, this variation was 
summarized and variants assigned as genotypes and 
subtypes in a consensus classification and nomenclature 



system and formal rules were agreed for the assign- 
ment and naming of future variants. Genotype and 
subtype assignments required: (1) one or more com- 
plete coding region sequence(s); (2) at least three epi- 
demiologically unrelated isolates; (3) a phylogenetic 
group distinct from previously described sequences; (4) 
exclusion of intergenotypic or intersubtypic recombina- 
tion, whether the components were classified or not. 
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The application of these criteria confirmed the assign- 
ment of six distinct genotypes, comprising 18 subtypes. In 
addition, 58 subtypes were provisionally assigned pending 
the availability of a complete coding region sequence or 
additional isolates. This agreement on nomenclature was 
mirrored by the establishment of several curated databases 
that organized HCV sequences as they became available 
and indicated which genotypes and subtypes were con- 
firmed or provisionally assigned (Los Alamos HCV 
Sequence Database, 3 euHCVdb, 4 Hepatitis Virus Data- 
base: http://s2as02.genes.nig.ac.jp/). Concurrently, a pro- 
posal was made to unify the numbering of HCV with 
reference to the genotype la isolate H77 (AF009606). 5 

Recently, this remarkable agreement and cooperation 
in HC>V nomenclature has been complicated by several 
developments. None of the HCV sequence databases are 
now actively curated and responsibility for naming new 
genotypes and subtypes has reverted de facto to individ- 
ual researchers. This, combined with publication delays, 
has created new contradictions in which isolates assigned 
to the same subtype (4b: FJ462435, FJ025855, 
FJ025856, and FJ025854; 6k: DQ278891 and 
DQ278893; 6u: EU408330, EU408331, and 
EU408332) belong to different subtypes according to the 
consensus criteria. 2 Another challenge is that the number 
of complete coding region sequences has increased from 
238 in 2005 to more than 1,300. Similarly, the number 
of variants matching the criteria for assignment as con- 
firmed genotypes/sub types has expanded from 18 to 67; 
several recent publications contain figures that are illegi- 
ble with regard to isolate name and/or accession num- 
ber, 6 ' 10 complicating subsequent comparisons. 

Finally, advances in sequencing technology have 
accelerated the rate at which HCV sequences are gen- 
erated. Recent articles have reported the partial sequen- 
ces of 282 isolates from Vietnam 11 and 393 isolates 
from China, 10 in each case identifying additional sub- 
types of genotype 6. Technological advances have also 
made it easier to obtain HCV complete coding region 
sequences through both dideoxysequencing and pyrose- 
quencing. The latter technique was recently used to 
obtain 31 complete coding region sequences belonging 



to 13 different subtypes. More than 225,000 HCV 
sequences are now available on GenBank and about 
30,000 added every year. This volume of sequence 
information and the diversity of known HCV variants 
make it increasingly important for researchers to have 
a single curated resource to refer to for accurate sub- 
type designations, reference genomes and alignments. 

This article updates the genotype and subtype assign- 
ments 2 ' 7 and the nomenclature rules, and describes the 
establishment of a reference Website hosted by the Inter- 
national Committee for the Taxonomy of Viruses 
(ICTV) to validate new genotype and subtype assign- 
ments, and provide updated reference alignments. 

Revision of Confirmed Genotypes and 
Subtypes 

Unique HCV complete or nearly complete coding 
region sequences available on NCBI Genome (969 sequen- 
ces, http://www.ncbi.nlm.nih.gov/genome) and the Los 
Alamos HCV sequence database (1,364 sequences > 8,000 
nt from http://hcv.lanl.gov/content/index) were aligned 
within SSEvl.l 12 using Muscle v3.8.31 13 and refined 
manually. Phylogenetic analysis of sequences containing 
>95% of the coding region reveals seven major phyloge- 
netic groupings corresponding to genotypes 1-7 (Fig. 1). 
Within these genotypes, grouping of the constituent sub- 
types is supported by 100% of bootstrap replications. 

Based on the consensus criteria, 2 confirmed subtypes 
(indicated by a letter following the genotype) require a 
complete or nearly complete coding region sequence 
differing from other sequences by at least 15% of 
nucleotide positions and sequence information from at 
least two other isolates in core/El (>90% of the 
sequence corresponding to positions 869 to 1,292 of 
the H77 reference sequence [accession number 
AF009606] numbered according to reference 5 ) and 
NS5B (>90% of positions 8,276 to 8,615) (Table 1). 
The use of a 15% threshold over the complete coding 
region is supported by analysis of the large number of 
potential subtypes now sequenced (Fig. 2). This reveals 
major and consistently placed gaps in the distribution 
of pairwise distances between and within subtypes of 
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Fig. 1. Phylogenetic tree of 129 representative complete coding region sequences. Up to two representatives of each confirmed genotype/sub- 
type were aligned (together with a third extreme variant of subtypes 4g and 6e) and a neighbor joining tree constructed using maximum compos- 
ite likelihood nucleotide distances between coding regions using MEGA5. 83 Sequences were chosen to illustrate the maximum diversity within a 
subtype. Tips are labeled by accession number and subtype (*unassigned subtype). For genotypes 1, 2, 3, 4, and 6, the lowest common branch 
shared by all subtypes and supported by 100% of bootstrap replicates (n = 1,000) is indicated by •. 



each genotype as follows: genotype 1: 12.9%-17.0%, 
genotype 2: 13.1%- 17.6%, genotype 3: 12.5%-19.6%, 
genotype 4: 12.7%-15.3% (except distances of 14% 
and 14.2% between JX227963 and two subtype 4g 
sequences), and genotype 6: 9.9%-l4.9% (except dis- 
tances of 13.1%-13.7% between EU246931 and three 
subtype 6e sequences). Hence, for all genotypes and 
with remarkably few exceptions, a clear division can be 
made between isolates that differ by <13% over their 
complete coding region sequences (members of the 



same subtype) and those that differ by >15% (differ- 
ent genotypes or subtypes). This analysis includes 
sequences distinct from any of the confirmed HCV 
subtypes but not currently represented by three or 
more independent isolates that remain unclassified 
subtypes (Table 2). Whether the exceptions noted are 
due to technical problems or to differing epidemiologi- 
cal histories is unknown. 

The seven confirmed genotypes (discussed below) 
comprise 67 confirmed subtypes, 20 provisionally 
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Table 1. Confirmed HCV Genotypes/Subtypes 



TABLE 1. Continued 



Genotype* 


Locus/lsolate(s) 


Accession number(s) 


Reference(s) 


Genotype 1 








la 


HPCPLYPRE, HPCCGAA 


M62321, M67463 


29,30 


lb 


HPCJCG, HPCHUMR 


D90208, M58335 


31,32 


lc 


HPCCGS, AY051292 


D14853, AY051292 


33 


le 


148636 


KC248194 


9 


lg 


1804 


AM9 10652 


34 


lh 


EBW443, EBW9 


KC248198, KC248199 


9 


11 


136142, EBW424 


KC248193, KC248197 


9 


Genotype 2 








2a 


HPCPOLR JFH-1 


D00944, AB047639 


35,36 


2b 


HPCJ8G, JPUT971017 


D10988, AB030907 


37,38 


2c 


BEBE1 


D50409 


39 


2d 


QC259 


JF735114 


40 


2e 


QC64 


JF735120 


40 


21 


D54 


DQ155561 


41 


2j 


C1799, QC232 


HM777358 JF735113 


6,40 


2k 


VAT96 


AB031663 


42 


2m 


QC178, BID-G1314 


JF735111, JX227967 


40,8 


2q 


963, 852 


FN666428, FN666429 


43 


2r 


QC283 


JF735115 


40 


Genotype 3 








3a 


HPCEGS, HPCK3A 


D17763, D28917 


44,45 


3b 


HPCFG 


D49374 


46 


3g 


BID-G1243, QC260 


JX227954, JF735123 


8,21 


3h 


QC29 


JF735121 


21 


31 


IND-HCV, BID-G1244 


FJ407092, JX227955 


8 


3k 


HPCJK049E1, QC105 


D63821, JF735122 


47,21 


Genotype 4 








4a 


ED43 


Y11604 


48 


4b 


QC264 


FJ462435 


16 


4c 


QC381 


FJ462436 


16 


4d 


03-18, QC382 


DQ418786, FJ462437 


49,16 


4f 


IFBT88, PS6 


EF589161, EU392175 


50,51 


4g 


QC193 


FJ462432 


16 


4k 


PS3, QC383 


EU392173, FJ462438 


51,16 


41 


QC274 


FJ839870 


16 


4m 


QC249 


FJ462433 


16 


4n 


QC97 


FJ462441 


16 


4o 


QC93 


FJ462440 


16 


4p 


QC139 


FJ462431 


16 


4q 


QC262 


FJ462434 


16 


4r 


QC384 


FJ462439 


16 


4t 


QC155 


FJ839869 


16 


4v 


CYHCV073, BID-G1248 


HQ537009, JX227959 


52,8 


4w* 


P212, P245 


FJ025855, FJ025856 


14 


Genotype 5 








5a 


EUH1480, SA13 § 


Y13184, AF064490 


53,54 


Genotype 6 








6a 


EUHK2,6a33 


Y12083, AY859526 


55,56 


6b 


Th580 


D84262 


57 


6c 


Th846 


EF424629 


58 


6d 


VN235 


D84263 


57 


6e 


GX004 


DQ314805 


59 


6f 


C-0044 


DQ835760 


60 


6g 


HPCJK046E2 


D63822 


47 


6h 


VN004 


D84265 


57 


61 


Th602 


DQ835770 


60 


61 


Th553 


DQ835769 


60 


6k 


VN405 


D84264 


57 


61 


537796 


EF424628 


58 


6m 


B4/92 


DQ835767 


60 


6n 


KM42, D86/93 


DQ278894, DQ835768 


17,60 


6o 


QC227 


EF424627 


58 



Genotype* Locus/lsolate(s) 



Accession number(s) 



Reference(s) 



6p 
6q 
6r 
6s 
6t 
6u 
6v 
6w 
6xa 

Genotype 7 
7a 



QC216 

QC99 

QC245 

QC66 

VT21, D49 

D83 

NK46, KMN-02 
GZ52557, D140 
DH012, DH028 

QC69 



EF424626 
EF424625 
EU408328 
EU408329 

EF632071, EU246939 
EU246940 

EU158186, EU798760 
DQ278892, EU643834 
EU408330, EU408332 

EF108306 



58 
58 
61 
61 

62,19 
19 

62,63 
17,64 
18 



Additions and changes from assignments proposed in shown in bold. 

*Consensus proposed genotype/subtype names. Where multiple sequences 
of a HCV genotype are available, two sequences have been listed, prioritized by 
(a) publication date or (b) submission date when unpublished. 

f Locus (or isolate name if locus is the same as the accession number). 

^Previously described as 4b. 7,14 

Sequence obtained from acute phase plasma of a chimpanzee experimen- 
tally infected with (human-derived) isolate SA13. 
"Previously described as 6u. 18 



assigned subtypes, and 21 unassigned subtypes. These 
tables have been posted on the ICTV Website at 
http://talk.ictvonline.org/links/hcv/hcv-classification.htm 
and will be updated regularly by the authors with 
information shared across existing HCV databases 
(http://hcv.lanl.gov/; http://euhcvdb.ibcp.fr/euHCVdb/), 
typing tools, and other resources (e.g., http://www.bio- 
africa.net/rega-genotype/html/subtypinghcv.html; 
http://comet.retrovirology.lu/; http://hcv.lanl.gov/con- 
tent/sequence/phyloplace/; http://s2as02.genes.nig.ac. 
jp/; http://www.viprbrc.org/). Alignments including 
representatives of these subtypes are available on the 



-Genotype 1 
-Genotype 2 

Genotype 3 
-Genotype 4 
-Genotype 6 

Genotypes 1-7 



p-distance 

Fig. 2. Distribution of p-distances between complete coding region 
sequences. The frequency of p-distances was calculated within and 
between genotypes using SSE. 12 Intra-genotype pairwise distances 
were calculated for all available complete coding region sequences 
except for subtypes la, lb, and 2b where 20 random sequences were 
used. For p-distances >0.15 (equivalent to a percent difference of 
15%), frequencies were scaled to reduce the maximum frequency to 
less than 300. Distances between genotypes were calculated using 
one or two representatives of each confirmed and unassigned subtype, 
with the frequencies scaled as above. 
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Table 2. Unassigned Complete Coding Region Sequences 



Genotype* 


Locus/lsolate(s)' 


Accession no(s) 


Reference 


Genotype 1 






65 


1_AJ851228 


AJ851228 


AJ851228 


1_KC248195 


160526 


KC248195 


9 


1_ HQ537007 


CYHCV025 


HQ537007 


52 


Genotype 2 








2JF735119 


QC331 


JF735119 


40 


2JF735112 


QC182 


JF735112 


40 


2JF735110 


QC114 


JF735110 


40 


2JF735117 


QC297 


JF735117 


40 


2JF735116 


QC289 


JF735116 


40 


2JF735118 


QC302 


JF735118 


40 


Genotype 3 








3JF735124 


QC115 


JF735124 


21 


Genotype 4 






8 


4JX227964 


B1D-G1253 


JX227964 


4_FJ025854* 


P026 


FJ025854 


14 


Genotype 6 








6_DQ278891 § 


KM45.KM41 


DQ278891.DQ278893 


17 


6JX183550 


QC273 


JX183550 


20 


6JX183552 


TV476 


JX183552 


20 


6JX183549 


KM35 


JX183549 


20 


6JX183551 


TV257 


JX183551 


20 


6JX183553 


TV533 


JX183553 


20 


6JX183554 


L349 


JX183554 


20 


6JX183557 


DH027 


JX183557 


20 


6JX183558 


QC271 


JX183558 


20 



♦Classification of sequences into genotypes but without subtype assignments 
using the format "genotype_Accession number." 
t Locus (or isolate name if locus is the same as the accession number). 
^Previously described as 4b. 14 
^Previously described as 6k. 17 



ICTV Website and at http://hcv.lanl.gov/content/se- 
quence/NEWALIGN/align.html/. 

The process of producing these tables has detected a 
small number of variants with conflicting assignments. 
Isolates P026, P212, P245, (FJ025854-6) are described 
as subtype 4b, 14 but these complete coding region 
sequences show <85% identity to the core/El of iso- 
late Zl (U10235, L16677), provisionally assigned as 
4b 15 that is more closely related to core/El of the 
complete coding region sequence of isolate QC264 
(FJ462435 16 ). P212 and P245 belong to the same, 
novel subtype for which NS5B sequence is available 
from a third isolate (P213, GU049362), so this 
becomes confirmed subtype 4w. Isolate P026 differs 
from all other genotype 4 sequences by > 17.5% but 
being represented by a single sequence remains cur- 
rently unassigned (Table 2). 

Similarly isolates KM45 and KM41 (DQ278891,3) 
have been assigned to subtype 6k, 17 but differ by 
>17% in complete coding region sequence from the 
subtype 6k isolate VN405 (D84264) and 6.7% from 
each other, and so remain an unclassified subtype of 
genotype 6. Two distinct groups of isolates have been 
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assigned to subtype 6u; EU408330-2 18 and 
EU246940. 19 The latter was submitted first to Gen- 
Bank and is represented by NS5B sequences from two 
additional isolates and so is assigned subtype 6u, while 
EU408330, EU408331, and EU408332 are designated 
subtype 6xa (see below). 

Finally, our analysis of both phylogenetic groupings 
and sequence distances suggests that a number of iso- 
lates 20 described in their GenBank accessions as 
"subtype k-related" (QC273, TV257, TV476, KM35), 
"subtype 1-related" (TV533, L349), "intermediate 
between subtypes 6m and 6n" (DH027), or 
"intermediate between subtypes 6j and 6i" (QC271) 
should be considered as unassigned novel subtypes. 

Additional Taxonomic Levels 

In making this taxonomic distinction into virus 
genotypes and subtypes we are aware of the difficulties 
of imposing a discrete classification scheme on a com- 
plex taxonomy. In particular, for genotypes 3 and 6 
there are undoubtedly several hierarchies of taxonomic 
relationships. For example, subtypes 6k and 61 form a 
clade along with several unassigned genotype 6 iso- 
lates. 20 A higher-level clade includes these sequences 
and subtypes 6m and 6n, while a further grouping 
consists of these subtypes and subtypes 6i and 6j (Fig. 
1). These phylogenetic hierarchies are reflected in the 
discontinuous distribution of p-distances between com- 
plete coding region sequences (Fig. 2), which com- 
prises three almost merging distributions (roughly 
15% to 20%, 20% to 25%, and 25% to 30%). Three 
distributions of intersubtype distances were also 
observed for genotype 3 (20% to 25%, 25% to 27%, 
and 27% to 30%), two distributions for genotype 2 
(18% to 22.5%, 23% to 26.5%), and uniform distri- 
butions for genotype 1 (17.7% to 25-4%) and geno- 
type 4 (15.3% to 23.1%). However, the internal 
divisions defined by the multiple distributions of dis- 
tances within genotypes 2, 3, and 6 have not been 
shown to correspond with geographical or epidemio- 
logical differences. The higher-level grouping of sub- 
types 3b, 3g, and 3i does not reflect a common 
geographical origin distinct from that of 3h and 3k. 
There is also no geographical correlation with the 
groupings of subtypes 6k, 61, and various unassigned 
isolates; for 6m, 6n, and an unassigned isolate; for 6h, 
6i, 6j, and an unassigned isolate; for 6a and 6b; for 6f 
and 6r; or for 6r and 6e. 22 Similarly, there are cur- 
rently no known virological or clinical reasons to rec- 
ognize these higher-level groupings. Without practical 
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utility, we therefore propose that the observed within- 
genotype hierarchies are not given any formal recogni- 
tion in their nomenclature. 

Proposed Updates and Changes to Rules 
for Genotype/Subtype Assignments 

Subtype Names. By definition, subtype name 
assignments would be limited to a maximum of 26 if 
designated by a single letter suffix (e.g., 2a-2z). We 
therefore suggest that subtypes are assigned up to the 
letter V and subsequent designations follow the 
eXtended form xa, xb, ... xz, in turn followed by ya, ... 
yz, za, ... zz, potentially giving a total of 101 subtypes 
of each genotype. This avoids potentially ambiguous 
terms such as "subtype 6x," which could be interpreted 
as "genotype 6 of unknown subtype," or designations 
such as "subtype 3aa," which might suggest a relation- 
ship with 3a. 

Provisional Genotypes. According to the 2005 
consensus classification protocol 2 new genotypes could 
be provisionally assigned from a single complete cod- 
ing region sequence, but partial or complete coding 
region sequences from additional isolates would be 
required to confirm these assignments. Since then only 
one provisional genotype has been identified (7a) rep- 
resented by a single isolate (QC69, EF 108306). Thus, 
in contrast to subtype assignments, the number of 
genotypes appears relatively limited and the require- 
ment to sequence multiple isolates now seems over- 
onerous. We propose that only a single complete 
coding region sequence is needed to confirm a new 
genotype assignment; QC69 is therefore confirmed as 
genotype 7a. 

Provisional Subtypes. The 2005 consensus proto- 
col also proposed that provisional subtypes could be 
assigned on the basis of sequence comparisons in the 
core/El and NS5B regions for at least three independ- 
ent isolates, requiring in addition a complete coding 
region sequence before being confirmed. Of the 58 
subtypes provisionally assigned in the 2005 article, 38 
have now been confirmed (Table 1). However, it is 
now much easier to obtain complete coding region 
sequences and very few additional provisional subtypes 
have been proposed. Instead, some authors have incon- 
sistently labeled unusual isolates with the suffix "?," 
"unassigned group I" 11 ' 23 or "subtype 1(1). " 9 We pro- 
pose that provisional subtype designations should no 
longer be provided for variants where complete 
genome sequences are lacking. The 20 remaining pro- 
visionally assigned subtypes will be maintained (Table 
3), since they already exist in the literature. Future 



subtype assignments will only be made (as confirmed 
assignments) when sequence data from three or more 
isolates including at least one complete or nearly com- 
plete coding region is provided. Where a complete 
coding region sequence is available but there are fewer 
than three isolates, we propose that these remain unas- 
signed. In Table 2 these are labeled using the form 
"Genotype_Accession number," e.g., 1_AJ851228. 

Recombinant and Other Forms. One issue that 
was not addressed in the 2005 consensus protocol 2 
was the naming of the newly discovered recombinant 
forms of HCV, their importance being unknown. 
Nine different recombinant forms of HCV have now 
been described (Table 4), of which only one (2k/ lb) is 
represented by multiple isolates; no multiple recombi- 
nants have been reported (reviewed in reference 24 ). In 
this context it does not seem necessary to revise the 
nomenclature generally used in the literature in which 
"RF" (recombinant form) is followed by the contribu- 
tory subtypes separated by "/" in the order in which 
they appear in the complete genome sequence. We 
suggest that recombinant forms with the same geno- 
typic structure but with different breakpoints or where 
the component genomic sections are unrelated are 
numbered consecutively with a numerical suffix (for 
example, RF2b/lb_l). 

Proposals for New Genotype/Subtype Assign- 
ments. The ICTV Flaviviridae Study Group is willing 
to take a coordinating role in the assignment of newly 
described variants of HCV. We urge researchers who 
have characterized new HCV variants that potentially 
qualify as new types or subtypes to contact Donald 
Smith (D.B.Smith@ed.ac.uk) or any member of the 
Study group (listed on http://ictvonline.org/subcomm 
ittee.asp?committee=25&se=5) in confidence before 
publication so that naming conflicts can be avoided 
and appropriate assignments made. 

Future Developments 

Despite the increasing number and diversity of 
HCV sequences, the system of classification of variants 
into genotypes and subtypes has proven surprisingly 
robust. The seven confirmed genotypes have strong 
bootstrap support (Fig. 1), and the partition of these 
genotypes into subtypes that differ over a complete 
coding region sequence by >15% reflects a natural 
hiatus in the distribution of sequence distances 
(Fig. 2). We welcome any comments or suggestions for 
the proposed classification guidelines. Areas of uncer- 
tainty remain with respect to the region of endemicity 
of genotype 5, represented by a single subtype 
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Table 3. Remaining Provisionally Assigned HCV Subtypes 



Accession number(s)* 





Isolate 


Core/El 


NSSB 


Reference(s) 


Genotype 1 










Id 


HC1-N15, HC1-N16 


L39299, L39302 


L38377, L38372 


66 


If 


FR2 


L38350 


L38371 


66 


li 


FR16, QC77 


n.a., AY434119 


L48495, AY434120 


67,68 


lj 


QC2, QC89 


AY434158, AY434128 


AY434106, AY434129 


67 


lk 


QC68, QC82 


AY434112, AY434122 


AY434113, AY434123 


67 


Genotype 2 










2f 


JK081, JK139 


D49754, D49757 


D49769, D49777 


47 


2g 


MED017 


n.a. 


X93323 


69 


2h 


MED007 


n.a. 


X93327 


69 


21 


FR15 


n.a. 


L48494 


68 


2n 


NL50 


L39309 


L44602 


66 


2o 


FR4 


L38333 


L38373 


66 


2p 


NL33 


L39300 


L44601 


66 


Genotype 3 










3c 


NE048 


D16612 


D14198/D16613 


70 


3d 


NE274 


D16620 


D14200/D16621 


70 


3e 


NE145 


D16618 


D16619 


70 


3f 


NE125, PK64 


D16614, n.a. 


D14203/D16615, L78842 


70,71 


Genotype 4 










4e 


CAM600, GB809 


L29589, L29629 


L29590, L29626 


72 


4h 


GB438, FrSSD35 


L29610, n.a. 


L29611, AJ291249 


72,73 


4i 


CAR4/1205 


L36439 


L36437 


74 


4j 


CAR1/501 


n.a. 


L36438 


74 



♦Accession numbers of sequences from the core/El and NS5B regions, "n.a.": not available; "/": denotes that the core/El or NS5B sequences are available 
from two different accession numbers. 
t Examples of each provisionally assigned HCV. 



isolated in Europe, Brazil, North Africa, and South 
Africa, and genotype 7, isolated from an emigrant 
from the Congo. We might also anticipate the further 
discovery of other HCV-like viruses in the genus 
Hepacivirus 25 ' 28 and variants closer genetically to 
HCV than the nonprimate hepacivirus that appears to 
be an endemic infection of horses worldwide. 25 As 



Table 4. Recombinant (RF) HCV 
Complete Coding Region Sequences 



RF* 


Breakpoint 


Accession 


Isolates* 


Reference 


RF2k/lb 


3186 


AY587845 


33 


75-77 


RF2i/6p 


3405-3464 


DQ155560 


1 


41 


RF2b/lb_l 


3456 


DQ364460 


1 


78 


RF2/5 


3366-3389 


AM408911 


1 


79 


RF2b/6w 


3429 


EU643835 


1 


64 


RF2b/lb_2 


3432 


AB622121 


1 


80 


RF2b/la 


3429-3440 


JF779679 


1 


81 


RF2b/lb_3 


3286-3293 


AB677530 


1 


82 


RF2b/lb_4 


3286-3293 


AB677527 


1 


82 



♦Recombinant forms (RF) for which complete genome sequences are avail- 
able are named according to the subtypes from which they are derived and in 
the order in which these appear in the genome. 

breakpoints are numbered with reference to H77 (AF009606). 

*Number of individuals from whom the RF has been isolated. 



more is learned about the host-specificity and diversity 
of hepaciviruses, the genotype classification of HCV 
may be logically incorporated within a unified classifi- 
cation of hepaciviruses at the species and potentially 
subspecies and subgenus levels. 

Acknowledgment: We thank Professor Ling Lu of the 
University of Kansas Medical Centre for providing several 
genotype 1 sequences prior to their release on GenBank. 

Note added in proofs: A recent paper by Jordier et al. 
(J. Med Virol 2013;85:1754-1764) describes sequences 
that allow the confirmation of subtypes 21 
(KC197235, KC197240) and 2t (KC197238) and 
identifies three unassigned subtypes of genotype 2 
(KC197236, KC197237, KC197239). 
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